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Abstract 

We show that the following variation of the single-source shortest path problem is NP- 
complete. Let a weighted, directed, acyclic graph G = (F, E, w) with source and sink vertices 
s and t be given. Let in addition a mapping / on i? be given that associates information 
with the edges (e.g., a pointer), such that /(e) = /(e') means that edges e and e' carry the 
same information; for such edges it is required that w{e) = w{e'). The length of a simple st 
path U is the sum of the weights of the edges on U but edges with /(e) = /(e') are counted 
only once. The problem is to determine a shortest such st path. We call this problem the 
edge information reuse shortest path problem. It is NP-complete by reduction from 3SAT. 


1 The Edge Information Reuse Shortest Path Problem 

A weighted, directed, acyclic graph G = (F, E, w) with source and sink vertices s,t G V is given. 
Edges represent some possible substructures and a path from s to t determines how substructures 
are put together to form a desired superstructure. Edge weights reflect the cost of the associated 
substructures (e.g., memory consumption). An ordinary shortest path determines an ordered, 
tree-like representation of the superstructure of least cost: a single root with children given by 
the edges of the path. Now, different edges in G may represent similar substructures, and if 
several such edges occur on a path, the cost of the substructure need be paid only once. We 
want to find a shortest such path from s to t where the weight of edges with similar information 
is counted only once. There may easily be such a path of less cost than an ordinary shortest 
path in G. In that case, a more cost-efficient representation of the desired superstructure is by 
a compressed tree where several children are represented by the same substructure. We model 
this problem by associating information with the edges in the form of a function f on E that 
may for instance be a pointer to the substructure represented by the edge. Two edges e,e' G E 
carry the same information and are similar in that respect if /(e) = f{e'). For the optimization 
problem to be well-defined, it is required that edges with /(e) = f{e') have the same weight 
w{e) = w{e') (alternatively, the edge with the smallest weight could be counted). Weights and 
pointers are positive integers. 

We call this problem the edge information reuse shortest path problem. The decision version 
is formalized as follows. 

INSTANCE: Given a weighted, directed, acyclic graph G = {y,E,w), two vertices s,t G V, 
edge information function /, integer A > 0. The length r{U) of a(n ordered) path U with edge 
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information reuse is defined as 


r{U) = 

e€U\{eGU\3e'<e:f{e')=fie)} 

where e' < e means that edge e' occurs before e on U. 

QUESTION: Is there an st path 17 in G with length r{U) < K1 

The edge information reuse shortest path problem is NP-complete, as the next section will 
show. 

2 Reduction from 3SAT 

Originally, a reduction from PARTITION was given, which turned out to be incorrect and 
fundamentally flawed (see Appendix). Instead, we use the same reduction from 3SAT [2l L02] 
as in the paper by Broersma et al. [T]. 

Proposition 1 The edge information reuse shortest path problem is NP-complete. 

Proof: It is clear that the problem is in NP. Let U he a path from s to t. We can check 
whether the path taking edge information reuse into account has length at most some K by two 
traversals of U. We use /(e) as a pointer to mark for information reuse. In the first traversal, 
all /(e) for e € [7 are marked as unused. In the second traversal the weights of unused /(e)’s 
are summed and /(e) marked as used. 

The 3SAT problem is, given a collection of m three-literals clauses Cj,0 < j < m over n 
variables Xj, 0 < z < n, to determine whether there is a true/false assignment to the n variables 
that satisfies the instance by making all m clauses true. 

Let an instance of 3SAT be given. For each variable Xi and each clause Cj a weighted 
subgraph is constructed, and these subgraphs are joined one after the other. All edge weights 
will be either 0 or 1 (not actually corresponding to truth values). The first node of the subgraph 
corresponding to xq is taken as the source s, and the last node of the subgraph corresponding 
to Cm-i is taken as the sink t. Any st path will have to pass through all variable and all clause 
subgraphs. The part of the path passing through the variable subgraphs will correspond to an 
assignment of truth values to the variables Xj. Edges in the variable subgraphs corresponding 
to Xi will share information with edges in the clause subgraphs where a literal corresponding to 
Xi appears. The construction will have the property that there is an information reuse shortest 
path of length n if and only if the 3SAT instance is satisfiable. 

The subgraph for variable Xj consists of four vertices Uj, Uj+i for 0 < i < n. The 
source vertex is s = uq. There are edges {ui,u[) and {ui,u'f) of weight 1, and edges (u',Ui+i) 
and (h(, ttj+i) of weight 0. A path going through vertex n' will assign true to Xi, whereas a path 
going through vertex will assign true to the negation Xj. The edges {ui,u'f) and {ui,u[) will 
share information with edges in clause subgraphs. 

The subgraph for clause Cj consists of five vertices Uj, nj, Uj+i for 0 < j < m. The 
sink vertex is the last vertex t = Vm- The last vertex Un of the variable subgraph is connected 
to the first vertex vq of the clause subgraph by an edge {un,vo) of weight 0. In each clause 
subgraph, there are three edges of weight 0, namely {Vj,Vj^i), {Vj,Vj+i), {Vj,Vj+i). Likewise 
there are three edges of weight 1, namely {vj,Vj), {vj,Vj), {vj,Vj) which share information with 
edges from the variable subgraphs. Information sharing is as follows: f{{ui,u[)) = fi{vj,Vj)) 
if literal A:,0 < A: < 3 of clause Cj is Xj, and f{{ui,u^)) = f{{vj,Vj)) if literal k,0 < k < 3 of 
clause Cj is x^. 
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The graph thus constructed has 0{n + m) vertices and edges, and can clearly be computed 
in a polynomial number of steps, including the function / assigning (shared) information with 
the edges. 

Now, the claim is that the given 3SAT instance is satisfiable, if an only if the shortest st 
path with edge information reuse has length exactly n. Any path from s to t must have length 
at least n, since it has to pass through all vertices Ui^l < i < n + 1 via an edge with weight 1. If 
the 3SAT instance is satisfiable, a path of length n where the weights of edges that share infor¬ 
mation with previous edges are not counted can be constructed as follows. The path through 
the variable subgraph is given directly by the truth assignment (if Xi is true, pass through 
if Xi is true, pass through it'). For each clause subgraph, at least one literal must be true, and 
choosing the corresponding edge for the path costs nothing on the path length since this edge 
can share information with the edge of the literal. Conversely, if there is an edge information 
reuse shortest path of length re, this path shows that a truth assignment exists that satisfies all 
clauses since it must pass through the clause subgraph through edges that share information 
with edges (set to true) in the variable subgraph. □ 

The graph constructed in the reduction is acyclic, which shows that the shortest st path 
problem with edge information reuse on acyclic graphs is NP-complete. 

3 Remark 

Alex Khodaverdian (personal communication, 24.5.2016) pointed out some problems with the 
original reduction from PARTITION (still retained in the appendix), which eventually showed 
this reduction to be fundamentally flawed. Fortunately, the 3SAT reduction from [T] shows the 
problem NP-complete, so that the original claim still holds. It might still be enlightening to 
find a different reduction. 

For the first version of this note, Gerhard Woeginger (personal communication, 15.9.2015) 
pointed out that NP-completeness follows straightforwardly from the NP-completeness of the 
problem of finding a simple st path with the smallest number of colors in a graph with colored 
edges, and supplied the reference [T]. Simply let the function / denote the edge colors and take 
all edge weights to be 1. A shortest st path with edge information reuse corresponds to a simple 
st path with the smallest number of colors. 
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A The flawed reduction from PARTITION 

We tried to show the edge information reuse shortest path problem NP-complete by a reduction 
from the NP-complete PARTITION problem [21 SP12]. 

Proposition 2 The edge information reuse shortest path problem is NP-complete. 
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Figure 1: The weighted, directed, acyclic graph constructed from the given PARTITION in¬ 
stance. The shortest path taking edge information reuse into account should have length B if 
and only if the PARTITION instance is feasible. 


It is clear that the problem is in NP. Let U he a path from s to t. We can check whether the 
path taking edge information reuse into account has length at most some K by two traversals 
of U. We use /(e) as a pointer to mark for information reuse. In the hrst traversal, all /(e) for 
e G [/ are marked as unused. In the second traversal the weights of unused /(e)’s are summed 
and /(e) marked as used. 

Now, let the item set A = {xq, ... Xn-i} with integer item weights w{xi) be an instance of 
PARTITION, and let B = w{x). We construct a weighted, directed, acyclic graph G such 

that there is a shortest path with edge information reuse of length B if and only if the answer to 
PARTITION is “yes”, that is, if there is a subset P of A such that J2x&p '>x{x) = J2x&A-p w{x). 

The graph G consists if three parts, the first with vertices uo,ui,... ,Un and vi,... Vn-i, 
the second with vertices Uq,u'i, ... and the third part with vertices Uq, u'l,... ,u'^, where 
Un = u'o and = Uq. The source is s = uq and the sink t = u'.^. For each i = 0,n — 1 there 
are edges (ui,Ui+i), and each with weight w{xi), and these three edges 

carry the same item information, that is /((uj, Uj+i)) = = f {{u", u"^i)). Different 

edges {ui,Ui+i) and {uj,Uj+i) carry different item information. The information associated 
with these edges designate the items of A. There are edges {uo,Un) and (ttg,u(j) both of weight 
[P/2j (which share no information). In the first part of G there is for each i = 1,... ,n — 1 
two edges {ui,Vi) and (ui,tti+i) of weight 0. The construction is illustrated in Figure [H It can 
clearly be carried out in 0{n) operations. 

There is clearly a path from s to t in G. The purpose of first part of the graph is to select 
items for the set P. For any path through this part the information associated with non-zero 
weight edges designate items chosen for P. Each subset P is chosen by a unique path. 

Assume that a shortest st path with information reuse is longer than B. The path either 
passes through the vertices tti, n 2 ,..., u„_i, and possibly some of the Vi vertices as well, or 
takes the shortcut edge {uo,Un). 

In the first case, the length of the path to vertex Un is at most [P/2j, and there is no 
possibility for information reuse since all edge information occurs for the first time in this part 
of G. Let W be the sum of the weights of the (ui,Ui+i) edges on the path. These weights 
will not be counted again since they share information with the edges (u(,u'^^) and {u'l 
If kF < [P/2j the shortest path must take the shortcut edge (uo,u(j) of weight [P/2j since 
B — W > [P/2j. The path through the final part of G from Uq to t will have weight W — B 
for a total of P -I- [P/2j. 

If the shortest path instead takes the shortcut edge {uo,Un) of weight lB/2\ (which means 
that there is no item Xi € A with w{xi) < [P/2j), then the path must also take the shortcut 
edge (uQ,tt(j) since there is no edge information to reuse and the non-shortcut path from Uq 
to u'.^ has length P. The part of the path from Uq to t in this case has cost P since no edge 
information can be reused, for a total of 2[P/2j -|- P. 
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Thus, the length of a shortest path with information reuse is B if and only it is possible to 
find a path from uq to of length exactly [-B/2J (that does not use the shortcut edge), and 
if the weight of the edges whose information has not been used in this part is likewise exactly 
[-B/2J. In this case, all edge information has been used, and the part of the path from ttg 
to t contributes weight 0. The solution P to the PARTITION is the information /(e) on the 
non-zero edges on the part of the path from s to Un- 

The flaw in this argument is simply that subpaths of a shortest st-path do not have to be 
shortest paths themselves. Therefore, with information reuse, there is always a shortest path of 
length exactly B, and the shortcut edges of weight L-^/2j do not have to be taken. 
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